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Normally transcriptionally silent genes in a cell line or microorganism may be activated for expression by inserting a pNA 
regulatory element which is capable of promoting the expression of a normally expressed gene product in that cell or which is 
promiscuous, the regulatory element being inserted so as to be operatively linked with the normally silent gene in question: The 
insertion is accomplished by means of homologous recombination by creating a DNA construct including a segment having a 
DNA segment of the normally silent gene (targeting DNA) and the DNA regulatory element to induce gene transcription. The 
technique is also used to modify the expression characteristics of any endogenous gene of a given cell line or microorganism. 
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Endogenous gene expression modification with regulatory 
element. 

5 VTKT.D OF TNVFNTION 

The present invention relates to a process for 
the modification of the expression characteristics of a 
gene which is naturally present within the genome of a 
stable cell line or cloned microorganism. In the preferred 

10 embodiment, the present invention relates to a process for 
the activation and expression of a gene that is present 
within a stable cell line and normally transcriptionally 
silent or inert. As a result, the protein product of that 
gene is expressed. This phenomenon occurs without 

15 transf ecting the cell with the DNA that encodes the 

product. Rather, the resident gene coding for the desired 
product is identified within a cell and activated by 
inserting an appropriate regulatory segment through a 
technique called homologous recombination. Positive and/or 

20 negative selectable markers can also be inserted to aid in 
selection of the cells in which proper homologous 
recombination events have occurred. As an additional 
embodiment, a specified gene can be amplified for enhanced 
expression rates, whether that gene' is normally 

25 transcriptionally silent and has been activated by means of 
the present invention, or endogenously expresses product. 

BACKGROUND OF THE TNVENTION 

It is well known that each cell within an 
organism contains the genetic information that encodes all 
of the proteins found within that organism. However, only 
a very small percentage of the genes present within a given 
cell type is actually transcribed. The intracellular 
mechanisms that regulate the array of genes to be 
transcribed are now understood. Cell specific proteins 
present within the nucleus interact with DNA regulatory 
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Segments that are linked with particular genes. This 
interaction of nuclear proteins with DNA regulatory 
sequences is required for gene transcription. This results 
in mRNA biosynthesis and ultimate expression of the encoded 
5 protein (Mitchell and Tjian, Science . 245:371,1989). 

These DNA regulatory segments or elements for 
each gene lie upstream from and, in some cases, within or 
even downstream of the coding regions. Through an 
interaction with cell specific nuclear proteins, DNA 
10 regulatory segments affect the ability of RNA polymerase, 
the rate limiting enzyme in protein expression, to gain 
access to the body of the gene and synthesize a mRNA 
transcript. Thus, these DNA segments and the resident 
nuclear proteins play a critical role in the regulation of 
15 expression of specific genes (Johnson and McKnight, Ann. 
Rev. Biochem. . 58:799, 1989) . 

The DNA regulatory segments are binding sites for 
the nuclear proteins. These nuclear proteins attach to the 
DNA helix and apparently alter its structure to make the 
desired gene available for RNA polymerase recognition, 
which facilitates gene transcription. The expression of 
these cell specific regulatory proteins determines which 
genes will be transcribed within a cell and the rate at 
which this expression will occur. As an example" of the 
specificity of this system, pituitary cells but not liver 
cells express pituitary proteins, even though the genes for 
the pituitary proteins are present within all liver cells. 
Nuclei of the liver cells do not contain the specific DNA 
binding proteins which interact with the elements of 
pituitary genes resident within the liver cells. 

Current Methods Empl oyed to Express Proteins Using 
Recombinant DNA Technology 

With the knowledge that specific DNA regulatory 
35 sequences are required to activate gene transcription 

within a particular cell type, scientists have expressed 
foreign genes within a particular cell type through genetic 
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engineering. In general, DNA regulatory segments that are 
recognized by the cell's nuclear proteins are placed 
upstream from the coding region of a foreign gene to be 
expressed. In this way, after insertion into the cell, 
5 foreign DNA may be expressed since the cell's nuclear 
regulatory proteins now recognize these DNA regulatory 
sequences. This technology has been employed to produce 
proteins that have been difficult to obtain or purify from 
natural sources by traditional purification strategies. 

10 In addition to the recognizable DNA sequences and 

the gene of interest, a selectable marker is attached to 
the DNA construction. In this way, only the cells that 
have taken up the DNA survive following culture in a 
selectable medium. For example, the gene for neomycin 

15 resistance may be included in the expression vector. 

Following transf ection, cells are cultured in G418, a 
neomycin antibiotic that is lethal to mammalian cells. If 
however, the cells have acquired the neomycin resistance 
gene, they will be able to withstand the toxic effects of 

20 the drug. In this way, only the cells that have taken up 
the transf ected DNA are maintained in culture. It is 
understood that any selectable marker could be used as long 
as it provided for selection of cells that had taken up the 
transf ected DNA. It is further understood that there is no 

25 criticality as to the specific location of the inserted 

genetic material within the cell. It is only important that 
it be taken up somewhere within the nucleus as both the 
regulatory segment and the foreign gene (as well as the 
selectable marker) are inserted together. 



30 



n^ficienei^ in th f > Current Methods of Gene Expression 

While the above techniques have been instrumental 
in exploiting the power of genetic engineering, they have 
not always been the most efficient methods to express 
35 genes. This is due to the fact that insertion of DNA into 
the nucleus of a cell line is usually accomplished through 
a technique known as transf ection. DNA that has been 
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engineered for expression in the cell line of interest is 
precipitated and the cell membrane is solubilized to allow 
entry of the DNA. As indicated above, the exact site into 
which the DNA incorporates into the genome is never 
predictable; indeed the DNA may remain episomal (not 
integrated into the genome) . This results in the 
unpredictability of both the level of expression of the 
protein produced and the stability of the cell line. 

A second shortcoming of this technique is the 
fact that the construction of the expression vector is 
extremely difficult when the gene of interest is relatively 
large (greater than 5-10 kilobases) . Many of the proteins 
expressed by recombinant DNA technology have been encoded 
by cDNAs rather than much larger genomic clones. This is 
done to reduce the overall size of the insert, while the 
use of cDNAs makes genetic engineering more convenient, 
rates of gene transcription and protein production may 
suffer as a result. It has recently been shown that 
expression levels are sometimes greatly enhanced through 
the use of genomic rather than cDNA inserts (Brinster et 

al " Pr ° C - N3tl - * Cad - 85:836-840, 1988, and Chung 

and Perry, Mol. Cell, Biol. , 9:2 075- 2082, 1989). 

Although the mechanisms responsible for this observation 
are not well understood, it is known that in certain 
situations enhancer elements present within introns can 
improve the transcriptional efficiency of the gene. There 
xs also evidence that introns, or the splicing events which 
result from the presence of introns, may have an effect on 
the RNA processing events which follow the initiation of 
transcription (Buchman and Berg, Mol. c e n . , 8 . 4395 _ 

4405, 1988). This may stabilize the transcript thereby 
improving the rate of mRNA accumulation, in the above 
cited Brinster et al paper, it is also postulated that the 
position of the introns within the gene may be important 
for phasing of nucleosomes relative to the promoter. The 
influence of various regulatory elements on transcription 
of eukaryotic genes is discussed in Khoury et al, Cell 
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33:313-14 (1983), Maniatis et al, Science . 236:1237-45 

(1987) and Muller et al, Eur. J. Biochem. . 176:485-95 

(1988) • 

Thirdly, to gain entry into the nucleus, the 
5 transfected DNA, including the entire coding region of the 
foreign gene, must traverse the cytoplasm following entry 
through the permeabilized plasma membrane of the cell* 
During that time, the DNA may come in contact with 
lysosomal enzymes which may alter or completely destroy the 
10 integrity of the DNA. Thus, the coding region of the DNA 
may not be identical to that which was transfected. 

The novel method of gene activation and/or 
expression modification that we describe below cannot 
result in the production of mutant forms of the desired 
15 protein, since the coding region of the desired gene is not 
subjected to enzymatic modifications. 

In summary, a large amount of the DNA transfected 
into the cell using traditional techniques, and 
particularly the coding region thereof, will not be 
2 0 faithfully transcribed. It may be degraded prior to entry 
into the nucleus, enzymatically perturbed so that it will 
not encode the entire desired protein or it may not contain 
all of the necessary regulatory segments to allow for 
transcription. it may be inserted into a portion of the 
25 genome that prevents transcription. If the cDNA is 

transcribed, the protein of interest may not be produced 
efficiently due to the omission of introns which may 
contain enhancers or enable efficient mRNA processing. 
Finally, it may remain episomal, promote protein production 
30 but be unstable as the cell population grows through cell 
division. 

It would be most desirable to develop a method of 
induction of gene expression that would produce a cell line 
that has incorporated the positive attributes of the 
35 existing methods but somehow circumvents the unattractive 
features. It would further be desirable to be able to 
express or modify endogenous expression of particular genes 
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in the cell type of choice, it is further desired to be 
able to take advantage of the potential' benefits that may 
be afforded by a complete genomic sequence which may 
include cryptic transcriptional enhancers that may reside 
within introns, by appropriate placement of introns for 
proper nucleosome phasing or by more efficient mRNA 
processing events. These advantages are ordinarily not 
enjoyed in recombinant DNA expression methods due to the 
size of the gene. If one were able to express a gene that 
is already resident in the genome, i.e., an endogenous 
gene, cell line stability and expression rates would become 
more consistent and predictable. 

SUMMARY OF THR TNVENTTOU 

Accordingly, it is an object of the present 
invention to eliminate the above-noted deficiencies in the 
prior art. 

It is another object of the present invention to 
provide a method of regulation and/or amplification of gene 
expression that incorporates the positive attributes of 
recombinant gene technology but circumvents the 
unattractive features. 

It is a further object of the present invention 
to provide a method for expressing specific genes present 
but normally transcriptionally silent in a cell line of 
choice . 

It is yet a further object of the present 
invention to provide a method for expressing proteins which 
takes full advantage of complete genomic sequences that are 
responsible for mRNA accumulation and/or transcription. 

It is still another object of the present 
invention to provide a method of modifying the expression 
characteristics of a gene of interest by inserting DNA 
regulatory segments and/or amplifying segments into the 
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genome of a stable cell line or cloned microorganism 
upstream of, within, or otherwise proximal to the native 
gene of interest. 

It is still a further object of the present 
5 invention to provide a method for modifying the expression 
characteristics of a gene which is naturally present within 
the genome of a stable cell line or cloned microorganism 
and at the same time insert characteristics which will aid 
in the selection of cells which have been properly 
10 modified* 

It is yet another object of the present invention 
to provide a genome having therein, proximal to the coding 
region or exons of a gene of interest, a regulatory or 
amplifying segment which does not naturally appear 
15 thereat. 

It is another object of the present invention to 
provide DNA constructs which can be used for accomplishing 
the homologous recombination methods of the present 
invention. 

20 It is a further object of the present invention 

to provide cell lines and microorganisms which include the 
genomes in accordance with the present invention. 

These and other objects of the present invention 
are accomplished by means of the technique of homologous 

25 recombination, by which one of ordinary skill in this art 
can cause the expression and, preferably, amplification of 
resident, albeit transcriptionally silent genes. By this 
technique, one can also modify the expression 
characteristics of a gene which is naturally present, but 

*° not necessarily silent or inert, within the genome of a 
stable cell line, such as, for example, to make the 
expression conditional, i.e., reptessible or inducible, or 
to enhance the rate of expression. 

The present invention provides a method of 

[ 5 modifying the expression characteristics of a gene within 
the genome of a cell line or microorganism. A DNA 
construct is inserted into that genome by the technique of 
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homologous recombination. The construct includes a DNA 
regulatory segment capable of modifying the expression 
characteristics of any gene to which it is operatively 
linked within the host cell line or microorganism, as well 
as a targeting segment homologous to a region of the genome 
at which it is desired for the DNA regulatory segment to be 
inserted. The construct and insertion technique is 
designed to cause the new DNA regulatory segment to be 
operatively linked to the gene of interest. Thus, without 
necessarily inserting any new coding exons, the expression 
characteristics of that gene are modified. In the 
preferred embodiment, the gene is one which is normally 
transcriptionally silent or inert within the host cell line 
or microorganism and, by means of the DNA regulatory 
region, which is targeted directly to the appropriate 
position with respect to that gene by means of homologous 
recombination, that gene is thereby activated for 
expression of its gene product. 

The DNA construct preferably includes two 
targeting segments which, while separated from one another 
in the construct by those elements to be inserted into the 
genome, are preferably contiguous in the native gene. 

The construct further preferably includes at 
least one expressible selectable marker gene, such as the 
25 gene providing neomycin resistance. This marker gene 

including a promoter therefor, is also disposed between the 
two targeting regions of the construct. 

In another embodiment, the construct includes an 
expressible amplif iable gene in order to amplify expression 
of the gene of interest. This gene, including a promoter 
therefor, is also disposed between the two targeting 
regions of the construct. In some cases the selectable 
marker and the amplifiable marker may be the same. 

in a further embodiment of the present invention 
the DNA construct includes a negative selectable marker 
gene which is not expressed in cells in which the DNA 
construct is properly inserted. This negative selectable 



20 



30 



35 



WO 91/09955 



PCT/US90/07642 



marker gene is disposed outside of the two targeting 
regions so as to be removed when the construct is properly 
combined into the gene by homologous recombination. An 
example of such a negative selectable marker gene is the 
5 Herpes Simplex Virus thymidine kinase gene. 

In yet a further embodiment, it is possible to 
modify the expression characteristics of a specific gene 
which already expresses a product in the cell line or 
microorganism of interest. This can be accomplished by 

10 inserting by homologous recombination a DNA construct which 
includes (1) an expressible amplifiable gene which 
increases the copy number of the gene of interest when the 
cell line or microorganism is subjected to amplification 
conditions and/or (2) a promoter/ enhancer element (or other 

15 regulatory element) which modifies the expression of the 
gene of interest such as, for example, by increasing the 
rate of transcription , increasing translation efficiency, 
increasing mRNA accumulation, making the expression 
inducible, etc. The gene expression which is modified in 

2 0 this manner may be natural expression or expression which 
has been caused by previous genetic manipulation of the 
cell line or microorganism. The previous genetic 
manipulation may have been by conventional techniques or by 
means of homologous recombination in accordance with the 

25 present invention. In the latter case, the DNA insertion 
which results in the modification of expression 
characteristics may be accomplished as part of the same 
genetic manipulation which results in expression of the 
gene or may be performed as a subsequent step. 

30 The present invention also includes the 

constructs prepared in accordance with the above discussion 
as well as the genomes which have been properly subjected 
to homologous recombination by means of such constructs and 
the cell lines and microorganisms including these genomes. 

35 
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Moreover, a process for preparation of the desired product 
by culturing the transformed cells according to the present 
invention is also included. 

5 BRIEF DESCRIPTTOM OF TKTC pRAWTTTOS 

Fig. l shows a general outline of a DNA construct 
in accordance with the present invention. 

Fig. 2A shows the mode of integration of the DNA 
construct into the genome in tne event of non-homologous or 
10 random recombination. 

Fig. 2B shows the mode of integration of the DNA 
construct in the genome in the event of homologous 
recombination. 

Fig. 3 shows the construction of a preferred 
homologous recombination vector in accordance with the 
present invention. 

Fig. 4 shows the mode of integration of a 
circular piece of DNA by homologous recombination when only 
a sxngle targeting piece of DNA is employed. 

Fig. 5 shows the pRSVCAT plasmid, including the 
restriction sites thereof. 

Fig. 6 shows the construction of the pRSV 
Plasmid, including the restriction sites thereof. 

Fig " 7 shows the PSV2NE0" piasmid, including the 
25 restriction sites thereof. 

Fig. 8 shows the construction of the pSVNEOBAM 
plasmid, including the restriction sites thereof. 

Fig. 9 shows the construction of the pRSVNEO 
plasmid, including the restriction sites thereof. 

Fig. 10 shows the construction of the pRSVCATNEO 
plasmid, including the restriction sites thereof. 

Fig. 11 shows a 15.3 kb fragment of the rat TSHB 
gene and showing various restriction segments thereof. 

Fig. 12 shows the construction of the 
PRSVCATNEOTSHB3 plasmid, including the restriction sites 
thereof . 
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Fig. 13 shows the construction of the 
pRSVCATNEOTSHB3-5XbaI plasmid, including the restriction 
sites thereof. 

Fig. 14 shows a portion of the nucleotide 
5 sequence of TSHB along with the regions thereof to which 

each primer for PCR amplification corresponds. Exons 2 and 
3 are shown in capital letters. A 247 BP amplified 
fragment is shown by underlined asterisks. 

Fig. 15 shows the results of poly aery lamide gel 
10 electrophoresis of cDNA synthesized from RNA extracted from 
various cell populations and whose TSHB cDNA, if present, 
has been amplified by PCR. The nature of the cells 
representing the various lanes is set forth in Fig. 15 
below the gel. 

15 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

Homologous recombination is a technique developed 
within the past few years for targeting genes to induce or 
correct mutations in transcriptionally active genes 

20 (Kucherlapati, Prog, in Nucl. Acid Res, and Mol. Biol. . 

36:301 (1989)). This technique of homologous recombination 
was developed as a method for introduction of specific 
mutations into specific regions of the mammalian genome 
(Thomas et al., Cell . 44:419-428, 1986; Thomas and" 

25 Capecchi, Cell . 51:503-512, 1987; Doetschman et al., Proc. 
Natl. Acad. Sci. , 85:8583-8587, 1988) or to correct 
specific mutations within defective genes (Doetschman et 
al., Nature, 330:576-578, 1987). 

Through this technique, a piece of DNA that one 

3 0 desires to be inserted into the genome can be directed to a 
specific region of the gene of interest by attaching it to 
"targeting DNA" . "Targeting DNA" is DNA that is 
complementary (homologous) to a region of the genomic DNA. 
If two homologous pieces of single stranded DNA (i.e., the 

35 targeting DNA and the genomic DNA) are in close proximity, 
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they will hybridize to form a double stranded helix. 
Attached to the targeting DNA is the DNA sequence that one 
desires to insert into the genome. 

There are a number of methods by which homologous 
5 recombination can occur, one example is during the process 
of replication of DNA during mitosis in cells. 

Through a mechanism that is not completely 
understood, parental double-stranded DNA is opened 
immediately prior to cell division at a local region called 
10 the replication bubble. The two separated strands of DNA 
may now serve as templates from which new strands of DNA 
are synthesized. One arm of the replication fork has the 
DNA code in the 5 • to 3 • direction, which is the 
appropriate orientation from which the enzyme DNA 
15 polymerase can "read". This enzyme attaches to the 5- 

portion of the single stranded DNA and using the strand as 
a template, begins to synthesize the complementary DNA 
strand. The other parental strand of DNA is encoded in the 
3' to 5- direction, it cannot be read in this direction by 
20 DNA polymerase. For this strand of DNA to replicate, a 
special mechanism must occur. 

A specialized enzyme, RNA primase, attaches 
itself to the 3- to 5. strand of DNA and synthesizes a 
short RNA primer at intervals along the strand. Using 
these RNA segments as primers, the DNA polymerase now 
attaches to the primed DNA and synthesizes a complementary 
pxece of DNA in the 5- to 3' direction. These pieces of 

newly synthesized DNA are called OkazalH m n ^ Tne 

RNA primers that were responsible for starting the entire 
reaction are removed by the exonuclease function of the DNA 
polymerase and replaced with DNA. This phenomenon 
continues until the polymerase reaches an unprimed stretch 
of DNA, where the local synthetic process stops. Thus 
although the complementary parental strand is synthesized 

35 overall in the 3* to 5' direction it- -i <= =,^- 

uirection, it is actually produced 
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by "backstitching" in the 5' to 3 • direction. Any nicks 
that might occur in the DNA during the "backstitching" 
process are sealed by an enzyme called DNA ligase. 

To maintain an absolute fidelity of the DNA code, 
5 a proofreading function is present within the DNA 

polymerase. The DNA polymerase requires primed pieces of 
DNA upon which to synthesize a new strand of DNA. As 
mentioned above, this can be a single strand of DNA primed 
with RNA, or a complementary strand of DNA. When the DNA 

10 polymerase finds mismatched complementary pieces of DNA, it 
can act as an exonuclease and remove DNA bases in a 3 » to 
5' direction until it reaches perfect matching again. 

With this background, it is now possible to 
understand the basis of the technique described herein. 

15 small pieces of targeting DNA that are complementary to a 
specific region of the genome are put in contact with the 
parental strand during the DNA replication process. It is 
a general property of DNA that has been inserted into a 
cell to hybridize and therefore recombine with other pieces 

20 of DNA through shared homologous regions. If this 

complementary strand is attached to an oligonucleotide that 
contains a mutation or a different sequence of DNA, it too 
is incorporated into the newly synthesized strand as a 
result of the recombination. As a result of the proof - 

25 reading function, it is possible for the new sequence of 

DNA to serve as the template. Thus, the transfected DNA is 
incorporated into the genome. 

If the sequence of a . particular gene is known, a 
piece of DNA that is complementary to a selected region of 

30 the gene can be synthesized or otherwise obtained, such as 
by appropriate restriction of the native DNA at specific 
recognition sites bounding the region of interest. This 
piece will act as a targeting device upon insertion into 
the cell and will hybridize to its homologous region within 

35 the genome. If this hybridization occurs during DNA 

replication, this piece of DNA, and any additional sequence 
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attached thereto, will act as an Okasajci fragment and will 
be backstitched into the newly synthesized daughter strand 
Of DNA. 

In the technique of the present invention, 
attached to these pieces of targeting DNA are regions of 
DNA that are known to interact with the nuclear regulatory 
proteins present within the cell and, optionally, 
amplifiable and selectable DNA markers. Thus, the 
expression of specific proteins may be achieved not by 
transf ection of DNA that encodes the gene itself and marker 
DNA, as is most common, but rather by the use of targeting 
- DNA (regions of homology with the endogenous gene of 
interest) coupled with DNA regulatory segments that provide 
the gene with recognizable signals for transcription, with 
thxs technology, it is possible to express and to amplify 
any cognate gene present within a cell type without 
actually transfecting that gene. in addition, the 
expression of this gene is controlled by the entire genomic 
DNA rather than portions of the gene or the cDNA, thus 
improving the rate of transcription and efficiency of mRNA 
processing. Furthermore, the expression characteristics of 
any cognate gene present within a cell type can be modified 
by appropriate insertion of DNA regulatory segments and 
wxthout inserting entire coding portions of the gene of 
interest. 

in accordance with these aspects of the instant 
invention there are provided new methods for expressing 
normally transcriptionally silent genes of interest, or for 
modifying the expression of endogenously expressing genes 
of interest, within a differentiated cell line The 
cognate genomic sequences that are desired to be expressed 
or to have their expression modified, will be provided with 
the necessary cell specific DNA sequences (regulatory 
and/or amplification segments) to direct or modify 
expression of the gene within the cell. The resulting DNA 
will comprise the DNA sequence coding for the desired 
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protein directly linked in an operative way to heterologous 
(for the cognate DNA sequence) regulatory and/ or 
amplification segments: A positive selectable marker is 
optionally included within the construction to facilitate 
5 the screening of resultant cells. The use of the neomycin 
resistance gene is preferred, although any selectable 
marker may be employed. Negative selectable markers may, 
optionally, also be employed. For instance, the Herpes 
Simplex Virus thymidine kinase (HSVtk) gene may be used as 
10 a marker to select against randomly integrated vector DNA. 
The fused DNAs, or existing expressing DNAs , can be 
amplified if the targeting DNA is linked to an amplif iable 
marker ♦ 

Therefore, in accordance with the method of the 

15 present invention, any gene which is normally expressed 
when present in its specific eukaryotic cell line, 
particularly a differentiated cell line, can be forced to 
expression in a cell line not specific for it wherein the 
gene is in a silent format. This occurs without actually 

2 0 inserting the full DNA sequence for that gene. In 

addition, that gene, or a normally expressing gene, can be 
amplified for enhanced expression rates. Furthermore, the 
expression characteristics of genes not totally 
transcriptionally silent can be modified as can the 

25 expression characteristics of genes in microorganisms. 

In one embodiment of the present invention, 
eukaryotic cells that contain but do not normally 
transcribe a specific gene of interest are induced to do so 
by the technique described herein. The homologous 

30 recombination vector described below is inserted into a 
clonal cell line and, following chemical selection, is 
monitored for production of a specific gene product by any 
appropriate means, such as, for example, by detection of 
raRNA transcribed from the newly activated gene, 

35 immunological detection of the specific gene product, or 
functional assay for the specific gene product. 
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The general outline of the DNA construct that is 
use, to transcriptionaily activate endogenous genes * 
homologous recombination is depicted in Figure 1 

In general, the DNA construct comprises at least 
5 two and up to six or more separate DNA segments. 



stents consist of at least one, preferably two, DNA 
targets .events ( A a„ d B) homologous t<J J J™ 
cell genome within or proximal to the gene desired to be 

« ^HoT '.'nT™ SelMti - — (O . an amplif lable 
re™, ! ' Election <T*ne (B) and a DNA 

Sr^TLTT T WWCh iS te —^«°-«y active in 

tne ceil to be transf ected. in th« ^ 

^ ne most basic embodiment- 

fB, a^r 8 "' inVentl ° n ' a single targeting segment 

IS i! ~ «g»ent (F) must be present, ^of 

« the other regions are optional and produce preferred 
constructs. ^ Jrjrea 

Regions A and B are DNA sequences which are 

- regions A and B of 

to be upstream and downstream ^ 

srream ' respectively, of the 
specific position at which it is desired for th , 
segment to be inserted. Although ZslVZ^lT^ 

>5 171T*T ^ C ° nStrUCt ^ ^ ««2hay contiguous 
-=> xn the endogenous gene. Therp ™ a „ ^ 9 

contiguous portions* of thl^ 'are Z^tZMT T 
segments, for example, where it is desired " ^ e^T " 
portion of the genome, such as a nor^ aeie *e a 

element. negative regulatory 

^ „ WhilS tW ° tar ^ing regions, A and B, are 
preferred in order to increase the total regions of 
homology and thus increase recombination efficiency the 
Process of the present invention also compress L use 
of only a single targeting region t« i+- • 
> 0*. cm, the regulator,^-, andte^LT 
marker gene c and promoter c are to be insert^ ! 
circular piece of dna is employed which -ntals^ese 
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elements along with the targeting DNA (see Figure 4). In 
this way, the homologous region (B) hybridizes with its 
genomic counterpart. Segments C , C and F are inserted 
within the B portion of the cognate gene following the 

5 crossover event. 

When it is desired for the DNA regulatory 
sequence to be inserted upstream of the gene of interest, 
as, for example, when it is desired to activate and express 
a normally transcriptionally silent gene, the region of 

10 homology is preferably homologous to a non-coding portion 
of the genome upstream of the coding portions of the gene 
of interest. When two targeting regions are present, the 
downstream region (A) may include a portion of the coding 
region, although it is preferred that it, too, be totally 

15 upstream of the coding region. It is further preferred 
that the homologous regions be chosen such that the DNA 
regulatory sequence will be inserted downstream of the 
native promoter for the gene of interest, particularly if 
the native promoter is a negative promoter rather than a 

20 turned-off positive promoter. 

The size of the targeting regions, i.e., the 
regions of homology, is not critical, although the shorter 
the regions the less likely that they will find the 
appropriate regions of homology and recombine at the 

25 desired spot. Thus, the shorter the regions of homology, 
the less efficient is the homologous recombination, i.e., 
the smaller the percentage of successfully recombined 
clones. It has been suggested that the minimum requirement 
for sequence homology is 25 base pairs (Ayares et al, PNAS, 

30 83:5199-5203, 1986). Furthermore, if any of the other 

elements of the construct are also found in the genome of 
the host cell, there is a possibility of recombination at 
the wrong place. However, in view of the excellent 
positive and negative selectability of the present 

35 invention, it can be successfully practiced even if the 

efficiency is low. The optimum results are achieved when 
the total region of homology, including both targeting 
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regions, is large, tor example one to three kii oba 
long as the regulatable segment f can kllobases ' As 

to the gene of interest ^ s „ • °peratively linked 
tne targeting region, J^JL^ ^ T ^ ° f 
5 targeting region B. ~«*arly the upstream 

regulable segment P spaced too far f "T t Z Tl 
«* the gene to be operatively link J f C ° d "' 9 re9i ° n 

« case, the regions A ane* r thereto, m such a 

.ec'tLn : f te t bcsoi °^ to * 

repeated until the regulaSbl, "°* the Pr ° cess 

Verted so as to be T^ll^t^*^ 
interest. For ev»TnrM~ *v ^ to the We of 

« region A-B o^tnT^enT ^ «* -*«d 

Process repeated" ^ ^ ~ 

invention is known, along with „ 
herein, one of ordinary sJii ^th dlsClosed 

and use the Pr^LZllZTll^ * <° 

" given gene of interest in ntl ° n Wlth respect to any 

without us. of unlTe t ^L e t 1 L line * 

^i=h is Chi" o'f^dr 81 "" SelSCtable — 
" resistant to TnorlllTt"" 9 «" "~ 

« -h genes are ^IVlZlZt"^ ^ 
Phosphotransferase fneo) ! amlno 9 1 ycoside 

hygro^ycin-B-phosphotalsfertTZrCT^ ' 
«*), xanthine-ouanin. . th ? mid ">e kinase 

multiple drug re^iXcT ! bOSyltrMSfe " Se <«*> - 

0 decarboxylase^ "dc, H ' 

resistance (CAX>) . B - (ph ° s P>>°"a=etyl>- L -as P artate 

an CSSr^.* ^ . — - 

the construct at *JL". ^Tr™* *» 
that lead to an increase in Pllflable 9 enes are genes 
selective pressure ^! c "*« WhS " — « 

adjacent to the ^uSLTI ™ 3 

a"PHfiable gene will also increase. 



910995.SA1 J_ > 



WO 91/09955 



PCI7US90/07642 



- 19 - 



Amplifiable genes that can be utilized include DHFR, MDR, 
ODC, ADA and CAD. The members of the positive selectable 
marker- gene group and those of the amplifiable gene group 
overlap so that, in theory, instead of using two genes, one 
5 for positive selection and one for amplification, one gene 
could be used for both purposes. However, since most cell 
lines contain endogenous copies of these amplifiable genes, 
the cells will already be somewhat resistant to the 
selection conditions and distinguishing the cells which 

10 have transfected DNA from those which do not receive 

transfected DNA can be difficult. Thus, in instances where 
an amplifiable gene is desired, a positive selection gene 
which is dominant, such as HPH, gpt, neo and tk (in tk- 
cells) , should also be included in the construct. For some 

15 applications it may be possible or preferable to omit the 

amplifiable marker. For instance, the gene of interest may 
not need to be amplified as, for example, when 
transcriptional activation by the heterologous DNA 
regulatory sequence is sufficient without amplification. 

20 Also, if the homologous recombination efficiency is very 

low, it may be necessary to leave out the amplifiable gene 
since the ratio of non-homologous DNA to homologous DNA is 
directly related to the homologous recombination efficiency 
(Letsou, Genetics . 117:759-770," 1987). It is also possible 

25 to eliminate the positive selection gene and select cells 
solely by screening for the production of the desired 
protein or mRNA. However, it is preferred in most cases to 
include at least the positive selection gene. 

Region E of the construct is a negative 

30 selectable marker gene. Such a gene is not expressed in 
cells in which the DNA construct is properly inserted by 
homologous recombination, but is expressed in cells in 
which the DNA construct is inserted improperly, such as by 
random integration. One such gene is the Herpes Simplex 

35 virus thymidine kinase gene (HSVtk) . The HSVtk has a lower 
stringency for nucleotides and is able to phosphorylate 



PCT/US90/07642 



- 20 - 



10 



15 



nucleotide analogs that normal mammalian cells are unable 
to phosphorylate. It the HSVtk is present in the cells 
nucleotide analogs such as acyclovir and gancyolovir are 
Phosphorylated and incorporated into the DHA or the host 

Zll^ killin9 csl1 - The presence ° f *»» 

selectable marker gene enables one to use the positive- 
negative selection for homologous recombination as 
described by Mansour et al (Harare., 336=348-352, l 988 ) 
capecchi uses a strategy which takes advantage of the ' 
differing modes of integration that occur when linearized 

ITT t tS ^ h °*° X °*°™ recombination as compared 

to when it mserts by random integration. If the vector 
DHA inserts randomly, the majority of the inserts will 
insert via the ends (Polger et al, Mol. en 
2 = 1372-1387, 1982; Roth et al, Mol. on 5: ' 2599 _ 
2607 1 9 8 5 ; and Thomas et al, au, 44 = 4X9-428, . ^ 

the other hand, if the vector inserts by homologous 
recombination, it will recombine through the regions of 
homology which cause the loss of seguences outside of those 

using the construct depicted in Figure 1 as an 
example, the mode of integration for homologous 
recombination versus random integration is illustrated in 
Figures 2A and ». In the case of non-homologous 

^r b i n !! i ° n (Fl9UrS 2A> ' ^ VSCt0r ls lnsert <* via the 
2t zlLT C ° nStrUOt - «9*°n «. in this case 

^e HSVt* gene, to be inserted into the genome. However, 
when homologous recombination occurs (Figure 2B) , the HSVtk 
gene is lost. The first round of selection uses the 
appropriate drug or conditions for the positive selection 
present within the construct. Cells which have DHA 
integrated either by homologous recombination or random 
integration will survive this round of selection. The 
surviving cells are then exposed to a drug such as 

HSVtk gene, in this case, most of the cells in which the 
vector integrated via a random insertion contain the H^tk 
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gene and are killed by the drug while those in which the 
vector integrated by homologous recombination have lost the 
HSVtk gene and survive. This allows the elimination of 
most of the cells which contain randomly integrated DNA, 
5 leaving the majority of the surviving cells containing DNA 
which integrated via homologous recombination* This 
greatly facilitates identification of the correct 
recombination event . 

The negative selection step can also be 

10 eliminated if necessary. It will require that the 

screening step be more labor intensive involving the need 
for techniques such as polymerase chain reaction (PGR) or 
immunological screening. 

The sixth region (F) contains the DNA regulatory 

15 segment that will be used to make the gene of interest 

transcriptionally active. The appropriate DNA regulatory 
segment is selected depending upon the cell type to be 
used. The regulatory segment preferably used is one which 
is known to promote expression of a given gene in 

20 differentiated host cell line. For example, if the host 
cell line consists of pituitary cells which naturally 
express proteins such as growth hormone and prolactin, the 
promoter for either of these genes can be used as DNA 
regulatory element F. When inserted in accordance with the 

25 present invention, the regulatory segment will be 

operatively linked to the normally transcriptionally silent 
gene of interest and will stimulate the transcription 
and/or expression of that gene in the host cell line. Also 
usable are promiscuous DNA regulatory segments that work 

30 across cell types, such as the rous sarcoma virus (RSV) 
promoter. As long as the regulatory segment stimulates 
transcription and/or expression, or can be induced to 
stimulate transcription and/or expression, of the gene of 
interest after being inserted into the host cell line so as 

35 to be operatively linked to the gene of interest by means 
of the present invention, it can be used in the present 
invention. It is important when joining the regulatory 
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segment F ^ the targeting segment A that no starting codon 
be accidentally introduced into the seguence since such an 
occurrence could alter the reading frame of the gene which 
is desired to be expressed. Of course, the construct must 
be constructed and inserted such that the regulatory 
segment F is operatively linked to the gene of interest 

D re Se n* • ^ T A regUlat ° ry Se ** ent ' region F, need not be 
present in instances where it is desired to enhance or 
amplify the transcription of a gene which is already 
expressing in the cell line of interest, either because it 
naturally expresses in that cell line or because the cell 
line has previously had its DNA manipulated to cause such 

ZT^L T SUC ^ inStanC6S ' inserti - of an amplifiable 
gene, region D, preferably with the positive selectable 

ZTl TT' regi ° n C ' ^ ° ptionallv «l-o with a negative 
selectable marker gene, region E, will be sufficient to 
xncrease the copy number of the gene of interest and thus 
enhance the overall amount of transcription 
Alternatively, a new regulatory segment, region F, 
inherently promoting an increased (or otherwise modified) 
rate of transcription as compared to the existing 
regulatory region for the gene of interest, may be included 
to further enhance the transcription of the existing 

2* ZlTl^lT ^ f tereSt - SUCh 3 ^ -gment 
could include promoters or enhancers which improve 

transcription efficiency. 

are used /T?" C *' D ' — «' ■» Promoter regions which 
are used to drive the genes in regions c, D, and E 

30 T^ZT^TT ^ t™^-^ active 

fLr! h ! T " ay be sane or different 

llZ T T * t0 **~ «» endogenous 

Scif , ^ SPeC " 1C dlreCtim of transcription 

Tth n *; 1 is not critical - «* o^i 

skill „ this art can determine any appropriate placement 
of the genes c. D and E and their promoters c , „. and 
such that the promoters win stimulate expression of their 
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associated genes without simultaneously disrupting in any 
way the expression of the gene of interest or any of the 
other genes of the construct. 

The present invention may be illustrated by 
5 reference to the activation of the rat thyrotropin beta 

subunit (TSHB) in GHj (ATCC CCL 82) , GH 3 (ATCC CCL 82.1) or 
GH A Cl cell lines (GH) . GH cell lines are derived from a 
radiation induced pituitary tumor in rats designated MtT/W5 
(Takemoto, Cancer Res* . 22:917, 1962) and adapted to grow 

10 in culture by Tashjian et al. Endocrinology , 82:342-352, 
1968. These cell lines may be subcloned and screened for 
their ability to produce growth hormone and TSHB. Such 
screening may preferably be by means of Northern blot 
analysis to determine whether mRNA for the rat growth 

15 hormone gene is present and to establish that there is no 

mRNA for the TSH6 gene being produced. The cell lines may 
also be screened by Southern analysis to determine that 
there is at least one copy of the TSHB gene present within 
the genome. Only the GH cell lines that produce growth 

20 hormone and not TSHB, but contain a copy of the TSHB gene, 
are used. 

The specific homologous recombination vector for 
use in GH cells may be designed in the following manner 
(Figure 3). Reg'ion A may consist of the 5' upstream 

25 untranslated region of the TSHB gene defined by the Hindlll 
fragment which stretches from -74 to -2785 and region B may 
contain the DNA fragment that stretches from the -2785 
Hindlll site to a Ncol site approximately 2 . 1 kb further 
upstream as described by Carr et al ( J. Biol. Chem. , 

30 262:981-987 # 1987) and Croyle et al ( DNA , 5:299-304, 1986). 
The positive selection gene (region C) may be a 1067 bp 
Bglll-Smal fragment derived from the plasmid pSV2neo (ATCC 
No. 37,149) (Southern et al, J. Mol. Appl. Gen. . 1:327- 
341, 1982). The neo gene may be driven by the Rous 

35 Sarcoma Virus (RSV) promoter (region G 1 ) which is derived 
from the Ndel-Hindlll fragment from the plasmid pRSVcat 
(ATCC No. 37,152) (Gorman et al, PNAS, 79:6777-6781, 



NSDOCID' <WO 9109955A1_I_> 



WO 91/09955 



PCT/US90/07642 



- 24 - 



10 



15 



20 



25 



30 



35: 



1982) . In this example, no amplifiable marker need be 
used and thus there need be no region D in order to 
optimize the efficiency of the homologous recombination. 
The efficiency is inversely related to the proportion of 
non-homologous to homologous sequences present in the 
construct (Letsou et al, Genetics . 117:759-770, 1987). 
Region E, or the negative selection gene, may consist of 
the HSVtk gene which is a 2 kb Xho fragment obtained from 
the plasmid pMCITK plasmid (Capecchi et al. Nature . 
336:348-352, 1988). The HSVtk gene in that construct may 
be driven by the polyoma virus promoter and enhancer 
(region E«) as constructed by Thomas et al ( Cell . 51:503- 
512, 1987). In a second DNA construct the polyoma promoter 
may be replaced by the RSV promoter described above. The 
DNA regulatory sequence used to activate the TSHB gene may 
be either the RSV promoter or the rat growth hormone 
promoter. The rat growth hormone promoter consists of the 
SacI-EcoRI fragment obtained from the plasmid P RGH237CAT 
(Larson et al, pnas, 83:8283-8287, 1986). The RSV promoter 
has the advantage of being usable in other cell lines 
besides GH cells, while the GH promoter is known to be 
active in GH cells and can be specifically induced (Brent 
et al, J. Biol. Chem. , 264:178- 182, 1989). The rat growth 
hormone promoter and the RSV promoter may be inserted at 
location F in separate constructs. 

Following transfection of the above construct 
into a GH cell line, the cells may be grown in media that 
contains G418. This will allow only those cells which have 
integrated plasmid DNA into the genome either by homologous 
recombination or random integration to grow. The surviving 
cells may be grown in media that contains gancyclovir. The 
majority of the cells that survive this round of selection 
will be those in which the vector plasmid DNA is integrated 
via homologous recombination. These cells may be screened 
to demonstrate that they are producing mRNA which 
corresponds to the TSHB gene and that they are producing 
the TSHB protein. The genomic DNA may also be sequenced 
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around the area of insertion of the heterologous promoter 
to insure that the proper recombination event occurred. 

EXAMPLE - Activation of TSHfi Gene in Rat Pituitary Cells 
5 Using the following protocol , thyrotropin beta 

subunit (TSH6) gene transcription, which normally does not 
occur ih the rat GH 3 pituitary cell line, was activated in 
those cells by using the process of homologous 
recombination to target an activating element upstream of 
10 the TSHB coding region. The Rous Sarcoma Virus (RSV) 
promoter is known to function efficiently in GH3 cells 
(Christian Nelson et al, Nature . 322:557-562 (1986); Zheng- 
Sheng Ye et al, The Journal of Biological Chemistry , 
263:7821-7829 (1988)) and therefore was chosen as the 
15 activating element. A plasmid vector was constructed which 
contained the RSV activating element, portions of the 5 1 
flanking region of the TSHB gene locus, and a selectable 
drug marker, aminoglycoside phosphotransferase gene (NEO) , 
for the isolation of transfected cell populations. 
20 Ribonucleic acid (RNA) was extracted from pooled drug 
resistant GH 3 cell populations and converted to 
complementary deoxyribonucleic acid (cDNA) . The cDNA was 
then screened by the technique of polymerase chain reaction 
(PCR) for the presence of TSHS cDNA. The constuction of 
25 the homologous recombination vectors and the control 
vectors is outlined below along with the experimental 
procedures and results. 

PLASMID CONSTRUCTION 
30 Homologous Recombination f HRV Backbone Vector 

( r>RS VCATNEO ) . 

The Rous Sarcoma Virus (RSV) promoter was derived 

from the plasmid pRSVCAT (Cornelia M. Gorman et al., 

Proceedings of the National Academy of Science . 79:6777- 
35 6781 (1982)) (figure 5) by isolating the 580 base pair (bp) 

Ndel - Hindlll fragment containing the functional promoter 
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unit. The ends of this fragment were blunted using DNA 
polymerase I Klenow fragment and Xbal linkers ligated to 
the blunt ends. After digestion with Xbal restriction 
endonuclease and gel purification, the resulting fragment 
was ligated into the Xbal site of pUC18 . A bacterial 
colony harboring a plasmid with the RSV insert in the 
orientation shown in figure 6 was designated pRSV. The 
aminoglycoside phosphotransferase gene (NEO) was cloned 
from PSV2NEO (P.J. Southern et al. , Journal of M m a .n 1a , 
and Applied Genetics , 1:327-341 (1982)) by isolating the 
Bglll and BamHI fragment (figure 7) and ligating that 
fragment into the BamHI site of pRSV (figure 6),.. A plasmid 
containing the NEO gene in the orientation shown in 
figure 8 was picked and designated pRS VNEOBAM . pRSVNEOBAM 
was digested with Smal and the 4328 bp fragment containing 
the RSV promoter region, the majority of the NEO gene and 
PUC18 was isolated by gel electrophoresis. The Smal ends 
of this fragment were Xhol linkered, cleaved with Xhol 
restriction enzyme and the plasmid recircularized by 
20 ligation. The resulting plasmid is shown in figure 9 and 
is called pRSVNEO. This last cloning step resulted in the 
deletion of a 786 bp fragment from the 3- end of the NEO 
fragment which is not necessary for its functional 
expression. This construction yields a plasmid in which 
the NEO gene is transcriptionally driven by the RSV 
promoter. 
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Next the Ndel site located 5' of the RSV promoter 
in pRSVCAT (figure 5) was converted to a Sail site. This 
was accomplished by digesting pRSVCAT with Ndel, filling in 
the ends using DNA polymerase I Klenow fragment and 
ligating Sail linkers to the resulting blunt ends. The 
linkers were digested to completion with sail and the 
plasmid recircularized by ligation. Into the newly 
constructed Sail site was cloned the Sail - Xhol fragment 
from pRSVNEO (figure 9) containing the RSV promoter and the 
NEO. gene. A plasmid with the RSV promoter and NEO fragment 
onented as shown in figure 10 was isolated and named 



JSDOCID: <WO. 



9109955A1 I > 



WO 91/09955 



PCT/US90/07642 



- 27 - 

pRSVCATNEO. This plasmid when transfected into GH 3 cells 
was capable of conferring G418 resistance to those cells, 
demonstrating the ability of the RSV promoter to drive 
transcription of the NEO gene and the ability of that RNA 
5 to be translated into a functional protein (data not 

shown) . Total RNA from the stable trans fectants above was 
analyzed by polymerase chain reaction (PCR) to determine 
whether the CAT gene was being transcribed. PCR results 
showed that the CAT gene was indeed being transcribed in 

10 all the G418 resistant colonies tested (data not shown), 
indicating that the RSV promoter 5 1 of the CAT gene was 
capable of driving transcription of a gene located 3 1 to 
it. This is important because this RSV promoter will be 
responsible for driving transcription of the TSHB gene when 

15 the TSHB HR vector described below integrates via 
homologous recombination into the GH 3 genome. 

TSHB HR Vector 

A vector capable of integrating into the GH 3 
20 genome by homologous recombination was created by inserting 
two stretches of the 5 1 flanking regions of the thyrotropin 
beta subunit (TSHB) gene into the unique Sail and Hindlll 
sites contained in pRSVCATNEO (figure 10). . A rat spleen 
genomic library containing inserts of 15 kilobases (kb) or 
25 greater cloned into lambda DASH was obtained from 

Stratagene, San Diego, CA. Using standard protocols 
(Current Protocols in Molecular Biology , pp. 1.9.1 - 1.13.6, 
6.1.1 - 6.4.10) a 15.3 kb clone of the rat genomic TSHB 
gene including 9kb of sequence 5 1 of the first exon was 
( 0 isolated. The 15.3 kb fragment consisted of two Xbal 

fragments, a 10.6 kb fragment corresponding to the 5 1 end 
of the 15.3 kb fragment and a 4.7 kb piece corresponding to 
the 3» region of the 15.3 kb fragment (figure 11). Both of 
these Xbal fragments were subcloned into pUC18 and plasmids 
5 containing inserts in both orientations were isolated. The 
2.3 kb Xbal - Hindlll fragment contained in the 4.7 kb Xbal 
fragment (figure 11) was purified and the Xbal site of this 
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fragment was converted to a Hindlli site by filllng in tne 

conL T int ° thS Hind *" ^te 

contaxned xn pRSVCATNEO (figure 10) . An isolate 

corresponding to a plasmid with the 2.3 Kb insert in the 
correct orientation as shown in fic™™ n , 

name pRSVCATNEOTSHB3 . ^ " ™ &SSi ^ ed **• 

TSHB clone T ^. SUbCl ° ned ^ f ">* th. rat 

Cl ° ne (f lgUre isolated and the Xbal ends 

converted to Sail sites by bi unt ending ^ 

DNA polymerase I Klenow fragment and attaching Z 1X 
linkers. This 10 6 irh q=>i-t * 

the sail sit! ! * fragment was then cloned into 

the Sail sxte of pRS VCATNEOTSHB 3 ffioure 121 a 
containing the insert in 12) . a plasmid 

identic," J ! correct orientation was 

xdentxfxed and named pRSVCATNE0TSHB3-5Xbai (figure 13, 

deo!sit C ° lleCtl0n ' R °<*ville, MD, and has received 
depository number ATCC 40933 p- 4-», 

deposit hS P ur P° s ® of this 

! ^ renamSd PHRTSH ' de PO S it was 

made in accordance with all <-v. . was 

Budapest Treaty. **" «»*"«—*« "t the 
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whi c * T' • 06118 3 SUbcloned Population of Mt T/ W5 

winch was derived -from =» /WD 
in rate yb * !Y Nation inauoed p it uit ary tumor 

in rate (B.K. Takeaoto, Canes,- 22:91? * r 

^ocT'T *° ^ ln " lt ~ «* *^*» « al" 
^doen nolnq y , 82 = 342-352 ,«„, . The ^ u ^ 

**«-d fro. the American Type culture C^ection^H 

ZZZ ZITT?'* ln oulture by in ---- 

nooiiiea Eagle's Medxum (DMEM) + iss- h„^ e « „ 
2.5% ratal bovine eerum (PBS) + 1% l TuT + 
at 37-c in 5% CO, . Wlutamme ( GHj media) 



35 



MSDOCIO <WO 91099S5A1_I_> 



WO 91/09955 



PCT/US90/07642 



DNA PREPARATION 

Large-Sc ale Preparation of Plas mid DNA 

All plasmids used for stable transf ections were 
purified by using the alkaline lysis method for large-scale 
5 plasmid DNA purification as described in Current Protocols 
in Molecular Biology , vol. 1, pp. 1.7.1 - 1.7.2. DNA 
isolated by the alkaline lysis method was further purified 
by double banding in a cesium chloride gradient as also 
described in Current Protocols in Molecular Biology , vol. 
10 1, pp. 1.7.5-1.7.7. 

Prior to transf ect ion, the HR vectors were 
digested with either Aatll or Apal. Apal was used to 
linearize the control plasmid pRSVCATNEO and Aatll to 
linearize the HR plasmid pRSVCATNEOTSHB3-5XbaI. The 
15 location of the cleavage sites of Apal and Aatll can be 
seen in figures 10 and 13 respectively. After digestion 
with the appropriate restriction enzyme, the reaction was 
phenol /chlor of orm extracted, chloroform extracted, ethanol 
precipitated, and washed once with 70% ethanol. The 
20 plasmids were then resuspended in sterile deionized water 

(dH 2 6) to a concentration of 1 microgram/microliter (Mg/Ml) 
as determined by absorbance at OD 260 . In an attempt to 
increase the transfection efficiency and/or the ratio of 
homologous recombination positives to those that" were due 
25 to random integration, pRSVCATNEOTSHB3-5XbaI was digested 
with Apal. Digestion with Apal cuts at three separate 
sites in pRSVCATNEOTSHB3-5XbaI and removes all regions of 
the vector except those necessary for homologous 
recombination (figure 13), After digestion with Apal, the 
3 0 reaction was electrophoresed on a 0.8% agarose gel and the 
top band corresponding to the 10,992 bp fragment containing 
the two 5' flanking regions of the TSHB gene, the RSV 
promoter - NEO region and the TSHB gerie-activating RSV 
promoter was isolated from the gel by electroelution into 
35 dialysis tubing. The electroeluted DNA was further 

purified by using an elutip minicolumn (Schleicher and 
Schuell) with the manufacturer's recommended standard 
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protocol. The DNA was eluted from the column, ethanol 
precipitated, washed with 70% ethanol and resuspended to a 
concentration of i fig/iii. 

5 STABLE TRANSFECTIONS 

Calcium Phosphat e Transf P rf^ w 

48 hours prior to transection 3 x io« GH 3 cells 
were plated on 10 centimeter (cm) dishes. For each dish 
10 „g of vector DNA along with 30 » g of sonicated salmon' 
sperm DNA was added to 0.5 milliliters (ml) of tranfection 
buffer The transfection buffer was prepared by combining 
4gNaCl, 0.l85gKcl, o.05g Na 2 HP0 4 , 0.5g dextrose, 2 .5g 
HEPES and dH 2 0 to a final volume of 500 ml and bringing the 
PH to 7.5. 31 M l of 2 molar (M) CaCl 2 was added to the 0.5 
ml of DNA + transfection buffer and vortexed. This 
solution was allowed to stand at room temperature for 45 
m^utes. when the DNA - CaCl 2 - transfection buffer was 
ready, the GH 3 medium was removed from the GH 3 cells and 
. the DNA - caci 2 - transfection buffer was layered over the 
fo 6 r I; Th V ellS ^ t0 ^ at «~ temperature 

added and the plates were incubated at 37 -c for 6 hours 
and then Sh ° Cked bY W 1 ™*** off the mediu* 

" ^eroTfor T^-T *»«« containing „ % 

PBS and fed with 10 ml of GH 3 medium. 48 hours post- 
transfection, the medium was removed and 10 ml of GH, 
medium containing 400 M g/ml G418 was added. 

30 ElectT-np oratinn 

Electroporation was carried out using a BTX 3 00 
Transactor with 3.5 millimeter (mm) gap electrodes. i x 
10 GH 3 cells growing in log phase were removed from their 
35 Y tryPSini2ati °*' P^t- by centrifugatiln ^ 

PBS and°t° e T PBS ' ^ 1. 0 ^ of 

PBS and transferred to 2.9 ml Ultra-UV disposable cuvettes 
(American Scientific Products) on ice. 10 „ of ^s 
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added to the cells, mixed and placed back on ice for 5 
minutes. After 5 minutes the electrodes were placed in the 
chamber and the cells were electroporated at a setting of 
750 microfarads with a 200 volt pulse. The cuvette was 
5 then returned to ice for 10 minutes. Cells were 
transferred from the cuvette to 9 ml of GH 3 medium 
containing 1% penicillin and 1% streptomycin at room 
temperature in a 15 ml conical tube and allowed to stand 
for 10 minutes. The total electroporation of 1 x 10 7 cells 
10 was transferred to three 10 cm plates giving approximately 
3 x 10 6 cells per plate. After 48 hours, the GH 3 medium 
containing 4 00 fig /ml G418 was added. 
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Transfection of GH 3 cells with pRSVCATNEOTSHB3-5Xbal (Aatll 
cut) . pRSVCATNEOTSHB3-5XbaI (Apal cut) and PRSVCATNEO (Anal 
cut) 

pRSVCATNEOTSHB3-5XbaI (Aatll cut) , 
pRSVCATNE0TSHB3^5XbaI (Apal cut) and pRSVCATNEO (Apal cut) 
plasmids were transfected into GH 3 cells along with a no 
DNA control using both the calcium phosphate protocol and 
the electroporation protocol. 48 hours after transfection, 
the cells were put under G418 selection. Approximately 14 
to 21 days later the colonies became visible by eye on the 
10 cm dishes and were counted. In all of the no DNA 
controls, there were no visible colonies, demonstrating 
that the G418 selection was working and that the presence 
of a plasmid containing the RSV - NEO region was necessary 
to confer G418 resistance. At this time, colonies were 
picked and pooled by isolating regions on the 10 cm dish 
with 17 millimeter wide cloning rings. These large cloning 
rings encompassed between 10 and 70 colonies depending on 
the density of the colonies per plate and allowed the GH 3 
cells in that isolated region to be removed and pooled at 
the same time by trypsinatibn. The trypsinized colonies in 
each ring were transferred to 6 well plates and allowed to 
grow in GH 3 media containing G418. After reaching 70% to 
80% confluence, 80,000 cells were transferred to a 24 well 
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Plate and the remaining cells cryopreserved for further 
testing at a later date. The cells in" the 24 well plates 
were grown until they reached 50% to 80% confluence. Total 
RNA was then harvested from these GH 3 cells by the 
following procedure. 

RNA ISOLATION FROM TRANSFECTED GH 3 CELLS GROWN IN 24 WELL 
PLATES 

The following is a modification of the protocol 
described by Chomczynski and Sacchi, Anal. Rin^ 
162:156-159 (1987) . The media covering the GH 3 cells in 
the 24 well plates was removed and the cells washed with 1 
ml of pes. i ml of GTC solution was added and the cells 
were incubated at room temperature for 5 minutes. GTC 
solution was prepared by dissolving 250 g of guanidium 
thxocyanate (Fluka) in 293 ml of dH z O, and then adding 
17.6 ml of 0.75 M Na citrate p H 7.0 and 26.4 ml of io% 
sarcosyl (L-Lauryl sarcosine, . Just prior to use, 360 M i 
of B-mercaptoethanol per 50 ml GTC solution was added 
After 5 mxnutes at room temperature, the 1 ml of GTC-cell 
lysate was transferred to a Sarstedt 55.518 snap-cap tube 
contaxnxng 2 ml of GTC solution. To each tube was added 
300 ^1 of 2M sodium acetate pH 4.0 and the tube vortexed. 
Next, 3 ml of dH 2 0 saturated phenol was added and the tubes 
were vortexed again. To each tube was added 600 M l of 
chloroform: isoamyl alcohol (49:1) and the tube was shaken 
by hand for 10 seconds and placed on ice for 15 minutes. 
The tubes were then centrifuge* in a Sorval RC-5B using a 
SM24 rotor at 8000 revolutions per minute (RP M) for 20 
mxnutes at 4-C. The aqueous phase was transferred to a 
fresh Sarstedt tube containing 3 ml of isopropanol and 
Placed at -20-C for 1 hour. After l hour the tubes were 
spun xn a Sorval RC-5B using a SM24 rotor at 8000 rpm for 
20 mxnutes at 4-0. The supernatants were removed and the 
pellets resuspended in 500 „1 of GTC solution. The 
resuspended RNA was transferred to a 1.5 ml eppendorf tube 
to whxch 500 „i of isopropanol was added. The tubes were 
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once again placed at -20 C C for 1 hour. The eppendorf tubes 
were spun for 5 minutes in a microfuge and the supernatant 
discarded. The pellet was washed with 70% ethanol 2 times 
and allowed to dry until the ethanol had completely 
5 evaporated. The pellet was resuspended in 20 fil of diethyl 
pyrocarbonate (depc) treated water and heated to 65 °C for 5 
minutes. This RNA was then used to make cDNA in one of the 
two procedures described below. 

cDNA REACTIONS 
Method 1 

First strand cDNA was synthesized from 2.5-6.0 
microliters of total RNA (approximately 0.5-6 micrograms) 
in a reaction volume of 10-20 microliters. The total RNA 
was obtained by the extraction method described above, and 
was denatured for 5-10 minutes at 70 °C and quick chilled on 
ice before adding the reaction components. The reaction 
conditions were 50 millimolar (mM) Tris-HCl (pH 8.3), 10 mM 
MgCl 2 , 10 mM DTT, 0.5 mM each of dCTP, dATP, dGTP, and dTTP 
(Pharmacia), 40 mM KC1, 500 units/ml RNasin (Promega 
Biotech), 85 fxg/ml oligo(dT) , 2 . n 8 (Collaborative Research, 
Inc.), and 15,000-20,000 units/ml Moloney murine leukemia 
virus reverse transcriptase (Bethesda Research 
Laboratories) incubated at 37 °C for 60 minutes. The 
reaction was terminated by the addition of EDTA to 4 0 mM, 
and the nucleic acid was precipitated by adding sodium 
acetate to a concentration of 0,3 M and two volumes of 
ethanol. The precipitate was; allowed to form at 0°C for 3 0 
minutes and was pelleted by centrif ugation in a microfuge 
at 14,000 rpm for thirty minutes. The pellet was washed 
with 70% ethanol, dried, and resuspended in depc treated 
water to a volume of 15-25 microliters. 

Method 2 

Conditions for first strand synthesis of cDNA 
from RNA were adapted from Carol A. Brenner et al, 
BioTechnicrues. Vol. 7, No. 10, pp. 1096-1103 (1989). 1 /xl 
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of total RNA from the RNA prep procedure described above 
was added to 9 M l of reaction buffer in a 0.5 ml eppendorf 
tube. The reaction buffer consisted of 200 units of 
Moloney murine leukemia virus reverse transcriptase (MMLVRT 
5 Bethesda Resesarch Labs) , and a final concentration of the 
following reagents: 70 mM Tris.HCl pH 8.8, 40 mM KC1, 0.1% 
Triton X-100, 1 mM of each dNTP, 4 mM MgCl 2 , and 0.45 OD 26 ' 0 
units of random hexamers (Pharmacia) . After mixing, the 
tubes were incubated at room temperature for 10 minutes and 
10 then placed at 42 °C for 1 hour. After 1 hour the tubes 

were heated to 90»C for 1 minute to deactivate the MMLVRT 
" and then cooled to room temperature . 

GH 3 LY S^ E CHAIN REACTION ( PCR ) AMPLIFICATION OF RNA FROM 

The following primers were used to amplify, by 
PCR, TSHB cDNA synthesized from RNA transcripts produced by 
the GH 3 cells as a result of the HR plasmids activating the 
endogenous TSHB gene by homologous recombination. 
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35 



primer 

^fH65 AGTATATGATGTACGTGGACAGG 
TSHB3 CACTTGCCACACTTGCAGCTCAGG 



Figure 14 shows the regions of the TSHB gene to 
25 which each primer corresponds. 

PCR REACTION CONDITIONS 

All PCR reactions were performed in the Ericomp 
Twinblock thermocycler. if PCR amplification was to be run 
on cDNA made by method 2, 40 M l of additional reaction mix 
was directly added to the 10 M l of the cDNA reaction 
bringing the total volume up to 50 nl. The final 
concentrations of reagents in the 50 /xl were 70 mM Tris HC1 
pH 8.8, 40 mM KC1, 0.1% Triton X- 100, 2.25 units Tag 
polymerase (Pharmacia), 0.2 micromolar (W) each primer, 
200 /iM each dNTP, and 0.8 mM MgCl 2 . 
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If PCR was to be performed on cDNA made by method 
1 above/ 5 to 10 /xl of the resuspended cDNA was added to 40 
to 45 pi containing final concentrations of the following: 
70 mM Tris.HCl pH 8-8, 40 mM KC1, 0,1% Triton X-100, 2.25 
5 units Taq polymerase, 0.2 /iM each primer, 200 /iM each dNTP, 
and 0 . 8 mM MgCl 2 . 

The reactions were then subjected to the 
following PCR cycles. 

1 minute at 94 °C. 
10 30 seconds at 55 °C. 

2 minutes at 72 °C. 

The above cycle was repeated 30 to 40 times. 10 
pi of each reaction mix was run on a 6% polyacrylamide gel 
and screened for the presence of a 247 bp PCR fragment 
15 which would indicate the presence of the properly spliced 
mRNA for TSHB. 

PCR RESULTS FOR AMPLIFICATION OF TSHB RNA FROM GH 3 CELLS 
AND RAT PITUITARY GLAND TOTAL RNA 

2Q To determine whether GH 3 cells normally 

synthesize TSHB RNA, cDNA from untransf ected GH 3 cells as 
well as cDNA from rat pituitary glands was subjected to the 
above PCR reaction conditions. The correct 247 bp band 
indicative of the presence of TSHB mRNA was visible in the 

25 positive control of the rat pituitary gland sample but no 

band was visualized from the GH 3 cell total RNA sample even 
after 60 cycles (data not shown). 

TRANS FECT I ON RESULTS 
30 The number of G418 resistant colonies present on 

the 10 cm dishes were tabulated between 14 and 21 days 
after addition of G418 to the media. 
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Trans f eet* cm Method 



Colonize re> r 1Q m rH oK 

. pRSVCATNKO PRSVCATWKOTSHB3 -«> vr a i 

Apal cut Aat2 gii+- 
Calcium phosphate 1 43 13 2g 

21 58 
1295 415 
1051 723 



Calcium phosphate 2 
Electroporation 1 
Electroporation 2 



Total RNA was harvested from the colony pools 
contained in the 24 well plates as described above. cDNA 
was made from these RNA preps and subjected to PGR 

Ts^r 011 ', t number ° f positive coi ° nies ™*»* 

TSHB mRNA was determined by the presence of a 247 bp 
fragment as visuali 2 ed on a polyacrylamide gel. Each of 
•the pools screened contained between 10 and 70 colonies. 

Zl TTT* nUaber ° f COl ° nieS ^ P ° 01 Per ^ansf ection 
was used to approximate the number of G418 resistant GH 3 

cell clones in which TSHB gene transcription was activated. 

If a pool tested positive, it was assumed that this 

represented one positive colony present in that particular 



P^TNEO ^^^^ m^P^ 

60 0 
PRSVCATNEOTSHB3 — 5XBA1 aqa-> 
(Aat2 digested) 942 3 

PRSVCATNEOTSHB3-5XBA1 o Rftn 

(Apal digested) 80 6 

These results demonstrate the successful 
activation of the normally transcriptionally silent TSHB 
gene by the method of the present invention. While the 

TZlT COl ° ni r ^ ^ POS±tiVe f ° r TSHfi iption 
xs small compared to the number of colonies that are G418 

resxstant (approximately one out of every i 0 3 G 418 



WO 91/09955 



PCT/US90/07642 



resistant colonies) , this result is generally consistent 
with rates reported for other homologous recombination 
experiments (Michael Kriegler, Gene Transfer a nd Expression 
A Laboratory Manual . Stockton Press, New York, NY (1990) , 
5 pp. 56 - 60) . It has been generally observed that the 
homologous recombination rate seems to be proportional to 
the rate of transcription of the targeted gene (M. Frohman 
and G. Martin, Cell . 56:145 (1989); S. L. Mansour et al, 
Nature, 33 6:348 (1988)). It should be noted that the rate 

10 which has been demonstrated is three orders of magnitude 
higher than what might be expected for random mutation 
turning oh the TSH6 gene . 

To ensure that the results for each colony pool 
were reproducible and that the activation of RNA 

15 transcription was stable, colony pools previously frozen 
away corresponding to pools which tested positive in the 
first screening were thawed and expanded in culture. The 
freshly thawed GH 3 positive pools were seeded in T 25 
tissue culture flasks and expanded until the cells reached 

20 70% to 80% confluence. 80,000 cells were then plated in 24 
well plates from each flask and grown until they were 50% 
to 70% confluent. RNA was extracted from the cells, 
converted into cDNA , and screened once again for the 
presence of TSHB RNA by running 10 /il of each PCR reaction 

25 on a 6% polyacrylamide gel. Figure 15 shows the results 
of representative PCR reactions from the second screening 
as visualized on a polyacrylamide gel by ethidium bromide 
staining and fluorescence. Lanes 1, 2, and 3 contain the 
PCR reactions run on cDNA from GH 3 cells which had been 

30 trahsfected by pRSVCATNEO. pRSVCATNEO contains no regions 
of homology to TSHB and thus is not capable of activating 
the TSH6 gene by homologous recombination. As can be seen 
on the gel in figure 15, there are no bands corresponding 
to 247 bp in those lanes indicating that the TSHB gene is 

35 hot activated. Lane 6 also contains a negative control. 

In that lane three pools were combined from samples of GH 3 
cells which had been transfected with pRSVCATNEOTSHB3-5XbaI 
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(Apal cut) but which were negative for transcription of the 
TSHfi gene on the first screening. The absence of the 247 
bp fragment in lane 6 demonstrates that the presence of the 
transfected pRSVCATNE0TSHB3-5XbaI (Apal cut) plasmid 
integrated randomly in the genome is not capable of 
producing the 247 bp TSHB PGR fragment. Lanes 7, 8 , 9, and 
10 xnclude PGR reactions run on cDNA made from total RNA 
harvested from rat pituitary glands in quantities per 
reaction of 25 nanograms, 100 nanograms, 200 nanograms, and 
400 nanograms, respectively. The presence in these lanes 
of the expected 247 bp band, produced from cDNA prepared 
from a rat tissue which normally expresses TSHfi, showed 
that the PCR reaction conditions were correctly 
^ optimized and that the PCR band obtained in lanes 4 and 5 
containing the homologous recombination TSHB positives is 
of the correct size. Two pools transfected with 
pRSVCATNEOTSHB3-5Xbal (Apal cut) which were positive in the 
first screening, Apal-107 in lane 4 and ApaI-136 in lane 5 
2o once again tested positive for TSHB gene activation as 

demonstrated by the presence of the correct TSHB pgr band 
amplified from cDNA made from the total RNA extracts from 
those pools proving that transcription of TSHB gene has 
been stably activated. The presence of bands at 247 bp in 
25 lanes 4 and 5 containing rna from previous positives Apal- 
107 and ApaI-136 and the absence of bands in the negative 
controls of pRSVCATNEO transfected GH 3 cells in lanes i - 3 
and the P RSVCATNEOTSHB3-5XbaI (Apal cut) negatives in lane 
6 demonstrated that the production of TSHB RNA in a cell 
3Q line that does not normally produce that RNA has been 
stably turned on by homologous recombination. 

The present invention is not limited to the cell 
line that is described herein. All cell lines have genetic 
information which is normally silent or inert. Most are 
35 able to express only certain genes. However, a normally 
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transcriptionally silent or inert gene of any such cell 
line can be activated to express the gene product in 
accordance with the present invention and any gene of the 
genome may have its expression characteristics modified in 
5 accordance with the present invention. Even previously 

transf ormed cell lines can be used as long as the previous 
transformation did not disrupt the gene of interest. The 
source of the cell line is not important. The cell line 
may be animal or plant, primary, continuous or immortal. 
10 Of course, it is desirable that any such cell line be 
stable and immortal so that after treatment with the 
technique in accordance with the present invention, 
expression can be commercialized. Cloned microorganisms, 
whether prokar y otic or eukaryotic, may also be treated by 
15 the technique of the present invention. 

While the present invention has been preferably 
described with respect to the expression of a normally 
transcriptionally silent or inert gene, the technique of 
the present invention is also applicable to the 
20 modification of the expression characteristics of a gene 
which is naturally expressed in the host cell line. For 
example, if it is desired to render the expression of a 
gene dependent upon culture conditions or the like so that 
expression can be turned on and off at will, an appropriate 
25 DNA regulatory segment, such as a regulatable promoter, can 
be inserted which imparts such characteristics, such as 
repressibility or inducibility. For example, if it is 
known that the cell type contains nuclear steroid 
receptors, such as estrogen, testosterone or 
3 0 glucocorticoid, or thyroxin receptors, one could use the 

steroid or thyroxin response elements as region F. Such a 
response element is any DNA which binds such receptor to 
elicit a positive response relative to transcription. Even 
if a cell is not naturally responsive to glucocorticoids, 
35 for example, a piece of DNA which encodes the 

glucocorticoid receptor could be added to the construct, or 
otherwise inserted somewhere in the genome, so as to make 
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the cell responsive to glucocorticoids. The use of a 
regulatable promoter could be desirable whether or not the 
gene of interest is normally transcriptionally silent 
Other kinds of regulation can also be obtained by targeting 
the appropriate DNA regulatory segment to the exact 
position of interest by means of the process of the present 
invention. 

Thus, while stimulation of expression of normally 
transcriptionally silent genes is the preferred application 
of the present invention, in its broadest sense it is 
applicable to the modification of expression 

characteristics of any gene endogenous to the host cell 
line. 

The specific technique of homologous 
recombination is not, per se, a novel part of the present 

skill ; h . SUCh t6ChniqUeS m -d those of ordinary 

skill m this art will understand that any such technique 

taroet USed /L the inVenti ° n ^ * S * 

targeting of the DNA regulatory sequence to the desired 

location with respect to the gene of interest, while a 
preferred technique is disclosed, using a linearized 
construct with two homologous regions on either end of the 
sequences to be inserted, any other technique which will 
accomplish this function, as, for example, by using 
circular constructs, is also intended to be comprehended by 
the present invention. The critical feature of the present 
invention is the use of homologous recombination techniques 
to insert a DNA regulatory sequence which causes 
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modification of expression characteristics in the cell line 
or microorganism being used, operatively linked with a gene 
in the genome of the cell line, preferably one which is 
normally transcriptionally silent, or to insert an 
5 amplif iable sequence, without a regulatory sequence, 

sufficiently near a gene in the genome of the cell line 
which already transcribes as to cause amplification of such 
gene upon amplification of the amplif iable sequence. It is 
not absolutely necessary that a selectable marker also be 

10 included* Selection can be based solely on detection of 

the gene product of interest or mRNAs in the media or cells 
following insertion of the DNA construct. Furthermore, in 
the embodiment in which a regulatory sequence is being 
inserted, amplification, while desired, is not critical for 

15 operability. The same is true for the negative selection 
gene which makes the screening process easier, but is not 
critical for the success of the invention. Thus, the basic 
embodiment requires only insertion of the DNA regulatory 
segment or the amplif iable segment in the specific position 

20 desired. However, the addition of positive and/or negative 
selectable marker genes for use in the selection technique 
is preferred, as is the addition of an amplif iable gene in 
the embodiment in which a regulatory segment is being 
added. 

25 The term "modification of expression 11 as used 

throughout the present specification and claims, is hereby 
defined as excluding termination of expression by inserting 
by homologous recombination a mutation, deletion, stop 
codon, or other nucleotide sequence, including an entire 

3 0 gene, into the gene of interest, so as to prevent the 

product of interest from being expressed. The prior art 
teaches the use of homologous recombination to insert 
specific mutations and the expression of a cell product may 
have inherently been terminated by means thereof (see, for 

35 example, Schwartzberg et al, PNAS (USA), 87:3210-3214 
(1990)). The present invention is not intended to 
encompass such a procedure. In the present invention the 
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"modification of expression" is accomplished by means of 
inserting regulatory and/or amplification regions at a 
specific desired location by means of homologous 
recombination. The preferred modifications are those which 
activate and/or enhance expression of the product of 
interest. 

Whenever the present specification uses the 
Phrase that a DNA regulatory segment is "operatively linked 
wxth" a gene, such terminology is intended to mean that the 
DNA regulatory segment is so disposed with respect to the 
gene of interest that transcription of such gene is 
regulated by that DNA regulatory segment. The regulatory 
segment is preferably upstream of the gene, but may be 
downstream or within the gene, provided that it operates to 
regulate expression of the gene in some way. The DNA 
regulatory segment may be a promoter, terminator, operator 
enhancer, silencer, attenuator, or the like, or any 
combination thereof. 
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used i ™ eneVer the terns upstream" or "downstream" are 
used x„ the present specification and claims, this is 
intended to mean. in the 5- -direction or the 3- -direction 
respectively, relative to the coding strand of the gene of 
interest. OI 

The foregoing description of the specific 
embodiments so fully reveals the general nature of the 
invention that others can readily modify and/or adapt such 
specxfxc embodiments for various applications without 

tZTtZ'T ^ 9eneriC COnCGPt - Any SUCh stations 
and modxf xcatxons are intended to be embraced within the 

meanxng and range of equivalents of the disclosed 

embodiments it is to be understood that the phraseology 

and termxnology employed herein are for the purpose of 

descrxption and not of limitation. 
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WHAT IS CLAIMED IS: 

1. A method of activating a normally 
transcriptionally silent gene within the genome of a cell 
line or microorganism so as to enable said cell line or 

5 microorganism to express the gene product of said gene, 
comprising inserting a DNA construct into said genome by 
homologous recombination , said DNA construct comprising a 
DNA regulatory segment capable of stimulating expression of 
said gene when operatively linked thereto and a DNA 
10 targeting segment homologous to a region of said genome 

within or proximal to said gene/ wherein said construct is 
inserted such that said regulatory segment is operatively 
linked to said gene of interest. 

2. A method in accordance with claim 1, 21 , or 
15 22, wherein said DNA construct comprises two DNA targeting 

segments, each homologous to a region of said genome within 
or proximate to said gene, one of said targeting segments 
being upstream of said regulatory segment and the other of 
said targeting segments being downstream of said regulatory 
20 segment. 

3. A method in accordance with claim 1, 2, or 
21, wherein said DNA construct additionally comprises at 
least one expressible selectable marker gene disposed so as 
to be inserted with said regulatory segment. 

25 4. A method in accordance with claim 1, 2, 3, 

21, 22, or 23, wherein said DNA construct additionally 
comprises a negative selectable marker gene disposed with 
respect to said targeting segment so as not to be inserted 
when said construct is properly inserted by homologous 

3 0 recombination, whereby said negative selectable marker is 
not expressed in cells in which said DNA construct is 
properly inserted. 

5. A method in accordance with claim 1, 2,3, 4, 
or 21, wherein said DNA construct additionally comprises an 

35 expressible amplifiable gene disposed so as to be inserted 
with said regulatory segment. 
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6. 



A method }n accordance with claim 1, 2, 3 4 
5, 21, 22, or 23, wherein said cell line or microorganism ' 
is a eukaryotic cell line. 

7. a method in accordance with claim 6, wherein 
5 said cell line or microorganism is an animal cell line. 

v, V- A ln accordance ^th claim 6, wherein 

said cell line or microorganism is a mammalian cell line. 

9. A method in accordance with claim 6, wherein 
said cell line or microorganism is a plant cell line. 
10 ^. . 10 ' A m€ *hod in accordance with claim 3, and 
additionally for causing expression of said gene product 
further including the steps of,, following said inserting 

selecting clones of said cell line or 
microorganism which express the product of said selectable 
marker gene; 

su^- • ™ ltivatln * the selected clones under conditions 
sufficient to permit expression of said gene product; and 
collecting said gene product. 

11. A method in accordance with claim 10 
wherein said selectable marker gene is the neomycin' 

thosfcl Ce ^ ^ Sel6Ctin * Ste * comprises selecting 

those clones having neomycin resistance. 

12 . A method in accordance with claim 10 or li 
wherein said DNA construct additionally comprises a 
negative selectable marker gene disposed with respect to 
said targeting segment so as not to be inserted when said 
construct is properly inserted by homologous recombination, 
whereby said negative selectable marker is not expressed in 

S s :? ^ 1Ch Sa±d ° NA ~™truot is properly inserted, and 

tl \ „ ' P fUrther inClUd6S -luting those clones 

which do not express said negative selectable marker gene. 

13. A method in accordance with claim 12 

llZT n v ald Sel6Ctable * the'nerpes 

Simplex Virus thymidine kinase gene and said selecting step 
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includes selecting those clones which survive exposure to a 
media that kills cells which express said gene. 

14. A genome having a DNA regulatory segment 
operatively linked with a naturally occurring gene at an 

5 insertion site characterized by a predetermined DNA 

sequence, said DNA regulatory segment not being naturally 
occurring at said location in the genome. 

15. A cell line or microorganism capable of 
expressing c* gene product by a normally transcriptionally 

10 silent gene within the genome of said cell line or 

microorganism, said genome having inserted therein a DNA 
regulatory segment operatively linked with said normally 
transcriptionally silent gene, said DNA regulatory segment 
being capable of promoting the expression of a gene product 

15 by said cell line or microorganism. 

16. A cell line or microorganism in accordance 
with claim ^5 or 25, wherein said DNA regulatory segment is 
one which is capable of promoting the expression of a gene 
product normally expressed by said cell line or 

20 microorganism. 

17 • A cell line or microorganism in accordance 
with claim 16, wherein the inserted DNA regulatory segment 
is pairt of a DNA construct comprising said DNA regulatory 
segment and at least one selectable marker gene. 

25 18 • A cell line or microorganism in accordance 

with claim 17, wherein said DNA construct additionally 
comprises an amplifiable gene. 

19. A method of obtaining a gene product from a 
cell line or microorganism, comprising culturing a 

3 0 differentiated cell line or microorganism in accordance 
with claim 15-18 or 24-26 under conditions which permit 
expression of said gene product, and collecting said gene 
product. 
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20. A DNA construct for insertion into a 
predetermined host cell line or microorganism, comprising a 
DNA regulatory segment capable of modifying the expression 
characteristics of genes in the host cell line or 
microorganism when operatively linked thereto and a DNA 
targeting segment homologous to a region of the genome of » 
preselected gene within the host cell line or 
microorganism . 



10 



21. A method of modifying the expression 
characteristics of a gene within the genome of a cell line 
or mxcroorganism, comprising inserting a DNA construct into 
saxd genome by homologous recombination, said DNA construct 
comprxsxng a DNA regulatory segment capable of modifying 
the expression characteristics of said gene when 
15 operatively linked thereto, as compared to its existing DNA 
regulatory segment, and a DNA targeting segment homologous 
to a region of said genome within or proximal to said gene 
wherein said construct is inserted such that said ' 

20 In^r S69ment ^ ° PeratlVel * ^ ^ said gene of 

22. A method of modifying the expression 
characteristics of a gene within the genome of a cell line 
or mxcroorganism, comprising inserting a DNA construct into 
saxd genome by homologous recombination, said DNA construct 
comprxsxng an expressible, amplifiable gene capable of 
amplifying said gene when inserted in sufficiently close 
proxxmxty thereto, and a DNA targeting segment homologous 
to a regxon of said genome within or proximal to said gene 
wherexn saxd construct is inserted such that said 
amplifiable gene is in sufficiently close proximity to said 

Lplifxabl ^ 031156 afflPlificati - hereof when said 

amplxfxable gene is amplified. 

23. A method in accordance with claim 22 
wherein said DNA construct additionally comprises * least 
one expressible selectable marker gene disposed so as to be 
inserted wxth said expressible, amplifiable gene. 



25 



35 
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24. A cell line or microorganism capable of 
enhanced expression of a gene product compared to the cell 
line or microorganism from which it is derived, said gene 
product beipg the expression product of an endogenous gene 
5 within the genome of said cell, said genome having inserted 
therein in an operative manner, at or near said endogenous 
gene, an exogenous DNA regulatory segment and/ or 
amplifiable gene capable of enhancing the expression of 
said gene product by said cell line or microorganism. 
10 25. A cell line or microorganism in accordance 

with claim 24, wherein said exogenous DNA regulatory 
segment and/or amplifiable gene is an exogenous DNA 
regulatory segment . 

26. A cell line or microorganism in accordance 
15 with claim 24, wherein said exogenous DNA regulatory 

segment and/or amplifiable gene is an exogenous amplifiable 
gene. 

27. A DNA construct for insertion into a 
predetermined host cell line or microorganism, comprising 

20 an expressible, amplifiable gene capable of amplifying a 
gene in the host cell line or microorganism when inserted 
in sufficiently close proximity thereto, and a DNA 
targeting segment homologous to a region of the genome of a 
preselected gene within the host cell line or 
microorganism. 

28. A method in accordance with claims 1, 2, 3, 
4, 5, 21, 22, or 23, wherein said cell iine or 
microorganism is a microorganism. 



25 
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LOCATION OF PRIMERS FOR PCR AMPLIFICATION OF TSH BETA 

5' 

g gcocgcctctgoatgt ggaaa g gococtt at gogctctgtggtctttccctctgattt 
a g C ATG AATGCT GTC6TT CT C T T TTC CGTGCT TTTCGCTCTTGCTTGTGGGC^AGTGT 

5' TSHB5 3' 

A GTATATGATGTACGTGGACAGG 
C AT CGTTTTGTAT TCCC ACT G A G T ATATGATGTACGTGGACAGGAGAG^GTGCCTAC 

**** ** ****** * **** *** XXXXXXXXXXXXXXXXX 

TG C C TG ACCATC AACACCACCATCTGC GCTGGG T ATTGT ATG ACACGG gtatgttggt 

X XXX XXXXXXXXXXXXXXXXX K*X*XXXXXXXXXXXX*XXXXXX*XXX 

cactgcgtttctt t t ogct gta o a 1 1 g tocagg f c taaagtt g tctgttaatattt tag 
aaoggaagtgggataaatcata gt ctcctctt tgggaagccaagcacactgctttcaga 
a 1 1 ataattatgt cattc t a cac a g aaaaagta cagatacat t g taacagtttacccta 

aagtgtttgttctgctcaatgg t a g a tg agaagaaagtgtccttttttgtctctgaggg 
g t taag tgtagat gtgtggg to a c a gagetcaggagtcctt taagatcatcaggaaaca 
aagggatat .tagtcattctp 1 1 a c ac taagttgcatgcagtttatcatgttaagatctc 
t t 1 1 c t tccacag GATATCAATG GCAAACTGTTTCTTCCCAAGTACGCACTCTC7CAG 

X XX X X X X X X X X X X XXXXXXX XXXXXXXXXXX XXXXXXXXXXXXX* 

G ATG TCTGTACATACAGAG ACT TC ACCTACA3AACG<?TGGAMTACCGGGATGC(XACA 
* *************x*****x*x*x***xxxxxxxx*^x*xxxx 

C C ATG TTGCTCCTT ATT TC TC C T A C CCCGTTGCC CTGAGCTGCAAGTGTGGCAAGTGTA 
x x xxx xx xxxxxxxxxx x x x x x x x x ***xxx*xxx xxxxxxxxxxxxxxxxxxxxxx 

G GACTCGACGTTCACACCGTTCAC 
3' TSHB3 5' 

A C AC TG ACT AC AG CGACTGTACACACG AGGCTGT C AAAACC AA CTACTGCACCAAGCCA 
C AG AC ATTCTATCTGGGGG GATTTTCTGGTTAACTGTAATGGCAATG CAATCTG GTTAA 
A T G TG TTTACCTG GA ATAG A ACTA ATAAAATATC ATTG AT atg tct tgcctgc cattt 
a a tccataggcacatccacaaggcat toga gage ttocacaactttagaagcagaggcg 

EXONS 2 AND 3 ARE IN CAPITAL LETTERS 
247 BP AMPLIFIED FRAGMENT UNDERLINED BY * 
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